Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 23921 |
| Missing cells | 126580 |
| Missing cells (%) | 21.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 4.4 MiB |
| Average record size in memory | 193.0 B |
Variable types
| Categorical | 9 |
|---|---|
| Numeric | 14 |
| Boolean | 2 |
date has a high cardinality: 5550 distinct values | High cardinality |
home_team has a high cardinality: 211 distinct values | High cardinality |
away_team has a high cardinality: 211 distinct values | High cardinality |
tournament has a high cardinality: 82 distinct values | High cardinality |
city has a high cardinality: 1576 distinct values | High cardinality |
country has a high cardinality: 217 distinct values | High cardinality |
home_team_fifa_rank is highly correlated with away_team_fifa_rank and 6 other fields | High correlation |
away_team_fifa_rank is highly correlated with home_team_fifa_rank and 6 other fields | High correlation |
home_team_total_fifa_points is highly correlated with home_team_fifa_rank and 5 other fields | High correlation |
away_team_total_fifa_points is highly correlated with away_team_fifa_rank and 6 other fields | High correlation |
home_team_goalkeeper_score is highly correlated with home_team_fifa_rank and 4 other fields | High correlation |
away_team_goalkeeper_score is highly correlated with away_team_fifa_rank and 4 other fields | High correlation |
home_team_mean_defense_score is highly correlated with home_team_fifa_rank and 4 other fields | High correlation |
home_team_mean_offense_score is highly correlated with home_team_fifa_rank and 4 other fields | High correlation |
home_team_mean_midfield_score is highly correlated with home_team_fifa_rank and 5 other fields | High correlation |
away_team_mean_defense_score is highly correlated with away_team_fifa_rank and 4 other fields | High correlation |
away_team_mean_offense_score is highly correlated with away_team_fifa_rank and 4 other fields | High correlation |
away_team_mean_midfield_score is highly correlated with away_team_fifa_rank and 4 other fields | High correlation |
away_team_continent is highly correlated with home_team_continent and 1 other fields | High correlation |
tournament is highly correlated with home_team_continent and 6 other fields | High correlation |
home_team_continent is highly correlated with away_team_continent and 1 other fields | High correlation |
neutral_location is highly correlated with tournament | High correlation |
away_team_score is highly correlated with home_team_result | High correlation |
home_team_result is highly correlated with away_team_score | High correlation |
home_team_goalkeeper_score has 15542 (65.0%) missing values | Missing |
away_team_goalkeeper_score has 15826 (66.2%) missing values | Missing |
home_team_mean_defense_score has 16134 (67.4%) missing values | Missing |
home_team_mean_offense_score has 15411 (64.4%) missing values | Missing |
home_team_mean_midfield_score has 15759 (65.9%) missing values | Missing |
away_team_mean_defense_score has 16357 (68.4%) missing values | Missing |
away_team_mean_offense_score has 15609 (65.3%) missing values | Missing |
away_team_mean_midfield_score has 15942 (66.6%) missing values | Missing |
home_team_total_fifa_points has 14290 (59.7%) zeros | Zeros |
away_team_total_fifa_points has 14288 (59.7%) zeros | Zeros |
home_team_score has 6273 (26.2%) zeros | Zeros |
away_team_score has 9558 (40.0%) zeros | Zeros |
Reproduction
| Analysis started | 2022-10-09 15:05:18.916817 |
|---|---|
| Analysis finished | 2022-10-09 15:05:50.122228 |
| Duration | 31.21 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 5550 |
|---|---|
| Distinct (%) | 23.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| 2012-02-29 | 66 |
|---|---|
| 2016-03-29 | 59 |
| 2008-03-26 | 59 |
| 2014-03-05 | 57 |
| 2022-03-29 | 55 |
| Other values (5545) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 239210 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1934 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | 1993-08-08 |
|---|---|
| 2nd row | 1993-08-08 |
| 3rd row | 1993-08-08 |
| 4th row | 1993-08-08 |
| 5th row | 1993-08-08 |
Common Values
| Value | Count | Frequency (%) |
| 2012-02-29 | 66 | 0.3% |
| 2016-03-29 | 59 | 0.2% |
| 2008-03-26 | 59 | 0.2% |
| 2014-03-05 | 57 | 0.2% |
| 2022-03-29 | 55 | 0.2% |
| 2012-11-14 | 55 | 0.2% |
| 2011-10-11 | 54 | 0.2% |
| 2011-11-11 | 54 | 0.2% |
| 2011-11-15 | 53 | 0.2% |
| 2011-09-02 | 52 | 0.2% |
| Other values (5540) | 23357 |
Length
| Value | Count | Frequency (%) |
| 2012-02-29 | 66 | 0.3% |
| 2008-03-26 | 59 | 0.2% |
| 2016-03-29 | 59 | 0.2% |
| 2014-03-05 | 57 | 0.2% |
| 2012-11-14 | 55 | 0.2% |
| 2022-03-29 | 55 | 0.2% |
| 2011-10-11 | 54 | 0.2% |
| 2011-11-11 | 54 | 0.2% |
| 2011-11-15 | 53 | 0.2% |
| 2011-09-02 | 52 | 0.2% |
| Other values (5540) | 23357 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 60867 | |
| - | 47842 | |
| 1 | 38674 | |
| 2 | 35093 | |
| 9 | 15929 | 6.7% |
| 6 | 9098 | 3.8% |
| 3 | 7542 | 3.2% |
| 7 | 6383 | 2.7% |
| 8 | 6307 | 2.6% |
| 5 | 5981 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 191368 | |
| Dash Punctuation | 47842 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 60867 | |
| 1 | 38674 | |
| 2 | 35093 | |
| 9 | 15929 | 8.3% |
| 6 | 9098 | 4.8% |
| 3 | 7542 | 3.9% |
| 7 | 6383 | 3.3% |
| 8 | 6307 | 3.3% |
| 5 | 5981 | 3.1% |
| 4 | 5494 | 2.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 47842 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 239210 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 60867 | |
| - | 47842 | |
| 1 | 38674 | |
| 2 | 35093 | |
| 9 | 15929 | 6.7% |
| 6 | 9098 | 3.8% |
| 3 | 7542 | 3.2% |
| 7 | 6383 | 2.7% |
| 8 | 6307 | 2.6% |
| 5 | 5981 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 239210 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 60867 | |
| - | 47842 | |
| 1 | 38674 | |
| 2 | 35093 | |
| 9 | 15929 | 6.7% |
| 6 | 9098 | 3.8% |
| 3 | 7542 | 3.2% |
| 7 | 6383 | 2.7% |
| 8 | 6307 | 2.6% |
| 5 | 5981 | 2.5% |
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| Mexico | 316 |
|---|---|
| USA | 314 |
| Japan | 280 |
| Saudi Arabia | 272 |
| Korea Republic | 249 |
| Other values (206) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 8.101793403 |
| Min length | 3 |
Characters and Unicode
| Total characters | 193803 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Bolivia |
|---|---|
| 2nd row | Brazil |
| 3rd row | Ecuador |
| 4th row | Guinea |
| 5th row | Paraguay |
Common Values
| Value | Count | Frequency (%) |
| Mexico | 316 | 1.3% |
| USA | 314 | 1.3% |
| Japan | 280 | 1.2% |
| Saudi Arabia | 272 | 1.1% |
| Korea Republic | 249 | 1.0% |
| Qatar | 249 | 1.0% |
| Oman | 241 | 1.0% |
| United Arab Emirates | 239 | 1.0% |
| Brazil | 233 | 1.0% |
| South Africa | 229 | 1.0% |
| Other values (201) | 21299 |
Length
| Value | Count | Frequency (%) |
| republic | 714 | 2.4% |
| and | 500 | 1.7% |
| korea | 323 | 1.1% |
| mexico | 316 | 1.1% |
| usa | 314 | 1.1% |
| ireland | 291 | 1.0% |
| japan | 280 | 0.9% |
| arabia | 272 | 0.9% |
| saudi | 272 | 0.9% |
| islands | 267 | 0.9% |
| Other values (236) | 25994 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 29866 | |
| i | 15905 | 8.2% |
| n | 14851 | 7.7% |
| e | 12723 | 6.6% |
| r | 11585 | 6.0% |
| o | 10169 | 5.2% |
| l | 7550 | 3.9% |
| u | 6692 | 3.5% |
| t | 6654 | 3.4% |
| d | 6305 | 3.3% |
| Other values (50) | 71503 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 157625 | |
| Uppercase Letter | 30186 | 15.6% |
| Space Separator | 5622 | 2.9% |
| Other Punctuation | 313 | 0.2% |
| Dash Punctuation | 57 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 29866 | |
| i | 15905 | |
| n | 14851 | |
| e | 12723 | 8.1% |
| r | 11585 | 7.3% |
| o | 10169 | 6.5% |
| l | 7550 | 4.8% |
| u | 6692 | 4.2% |
| t | 6654 | 4.2% |
| d | 6305 | 4.0% |
| Other values (21) | 35325 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3193 | 10.6% |
| A | 2539 | 8.4% |
| C | 2249 | 7.5% |
| M | 2076 | 6.9% |
| B | 2007 | 6.6% |
| I | 1977 | 6.5% |
| R | 1940 | 6.4% |
| E | 1466 | 4.9% |
| T | 1430 | 4.7% |
| G | 1420 | 4.7% |
| Other values (15) | 9889 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 159 | |
| ' | 154 |
Space Separator
| Value | Count | Frequency (%) |
| 5622 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 57 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 187811 | |
| Common | 5992 | 3.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 29866 | |
| i | 15905 | 8.5% |
| n | 14851 | 7.9% |
| e | 12723 | 6.8% |
| r | 11585 | 6.2% |
| o | 10169 | 5.4% |
| l | 7550 | 4.0% |
| u | 6692 | 3.6% |
| t | 6654 | 3.5% |
| d | 6305 | 3.4% |
| Other values (46) | 65511 |
Common
| Value | Count | Frequency (%) |
| 5622 | ||
| . | 159 | 2.7% |
| ' | 154 | 2.6% |
| - | 57 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 193565 | |
| None | 238 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 29866 | |
| i | 15905 | 8.2% |
| n | 14851 | 7.7% |
| e | 12723 | 6.6% |
| r | 11585 | 6.0% |
| o | 10169 | 5.3% |
| l | 7550 | 3.9% |
| u | 6692 | 3.5% |
| t | 6654 | 3.4% |
| d | 6305 | 3.3% |
| Other values (45) | 71265 |
None
| Value | Count | Frequency (%) |
| ô | 154 | |
| ç | 33 | 13.9% |
| ã | 17 | 7.1% |
| é | 17 | 7.1% |
| í | 17 | 7.1% |
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| Zambia | 243 |
|---|---|
| Costa Rica | 217 |
| Paraguay | 216 |
| Sweden | 206 |
| Mexico | 201 |
| Other values (206) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 8.126959575 |
| Min length | 3 |
Characters and Unicode
| Total characters | 194405 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Uruguay |
|---|---|
| 2nd row | Mexico |
| 3rd row | Venezuela |
| 4th row | Sierra Leone |
| 5th row | Argentina |
Common Values
| Value | Count | Frequency (%) |
| Zambia | 243 | 1.0% |
| Costa Rica | 217 | 0.9% |
| Paraguay | 216 | 0.9% |
| Sweden | 206 | 0.9% |
| Mexico | 201 | 0.8% |
| Brazil | 200 | 0.8% |
| Jamaica | 199 | 0.8% |
| Saudi Arabia | 199 | 0.8% |
| Iraq | 199 | 0.8% |
| Ghana | 198 | 0.8% |
| Other values (201) | 21843 |
Length
| Value | Count | Frequency (%) |
| republic | 641 | 2.2% |
| and | 511 | 1.7% |
| korea | 331 | 1.1% |
| islands | 284 | 1.0% |
| ireland | 252 | 0.9% |
| zambia | 243 | 0.8% |
| guinea | 241 | 0.8% |
| congo | 229 | 0.8% |
| costa | 217 | 0.7% |
| rica | 217 | 0.7% |
| Other values (236) | 26234 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 30129 | |
| i | 16256 | 8.4% |
| n | 14765 | 7.6% |
| e | 13309 | 6.8% |
| r | 11382 | 5.9% |
| o | 10235 | 5.3% |
| l | 7532 | 3.9% |
| u | 7045 | 3.6% |
| t | 6392 | 3.3% |
| d | 6135 | 3.2% |
| Other values (50) | 71225 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 158602 | |
| Uppercase Letter | 29877 | 15.4% |
| Space Separator | 5479 | 2.8% |
| Other Punctuation | 354 | 0.2% |
| Dash Punctuation | 93 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 30129 | |
| i | 16256 | |
| n | 14765 | |
| e | 13309 | 8.4% |
| r | 11382 | 7.2% |
| o | 10235 | 6.5% |
| l | 7532 | 4.7% |
| u | 7045 | 4.4% |
| t | 6392 | 4.0% |
| d | 6135 | 3.9% |
| Other values (21) | 35422 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3129 | 10.5% |
| C | 2507 | 8.4% |
| A | 2064 | 6.9% |
| B | 2041 | 6.8% |
| M | 1979 | 6.6% |
| I | 1902 | 6.4% |
| R | 1881 | 6.3% |
| P | 1536 | 5.1% |
| G | 1534 | 5.1% |
| T | 1449 | 4.8% |
| Other values (15) | 9855 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 182 | |
| . | 172 |
Space Separator
| Value | Count | Frequency (%) |
| 5479 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 93 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 188479 | |
| Common | 5926 | 3.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 30129 | |
| i | 16256 | 8.6% |
| n | 14765 | 7.8% |
| e | 13309 | 7.1% |
| r | 11382 | 6.0% |
| o | 10235 | 5.4% |
| l | 7532 | 4.0% |
| u | 7045 | 3.7% |
| t | 6392 | 3.4% |
| d | 6135 | 3.3% |
| Other values (46) | 65299 |
Common
| Value | Count | Frequency (%) |
| 5479 | ||
| ' | 182 | 3.1% |
| . | 172 | 2.9% |
| - | 93 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 194113 | |
| None | 292 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 30129 | |
| i | 16256 | 8.4% |
| n | 14765 | 7.6% |
| e | 13309 | 6.9% |
| r | 11382 | 5.9% |
| o | 10235 | 5.3% |
| l | 7532 | 3.9% |
| u | 7045 | 3.6% |
| t | 6392 | 3.3% |
| d | 6135 | 3.2% |
| Other values (45) | 70933 |
None
| Value | Count | Frequency (%) |
| ô | 182 | |
| ç | 35 | 12.0% |
| ã | 25 | 8.6% |
| é | 25 | 8.6% |
| í | 25 | 8.6% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| Europe | |
|---|---|
| Africa | |
| Asia | |
| North America | |
| South America |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.92818026 |
| Min length | 4 |
Characters and Unicode
| Total characters | 165729 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | South America |
|---|---|
| 2nd row | South America |
| 3rd row | South America |
| 4th row | Africa |
| 5th row | South America |
Common Values
| Value | Count | Frequency (%) |
| Europe | 7593 | |
| Africa | 5885 | |
| Asia | 5302 | |
| North America | 2772 | 11.6% |
| South America | 1839 | 7.7% |
| Oceania | 530 | 2.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| europe | 7593 | |
| africa | 5885 | |
| asia | 5302 | |
| america | 4611 | |
| north | 2772 | 9.7% |
| south | 1839 | 6.4% |
| oceania | 530 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 20861 | |
| a | 16858 | |
| i | 16328 | |
| A | 15798 | |
| e | 12734 | 7.7% |
| o | 12204 | 7.4% |
| c | 11026 | 6.7% |
| u | 9432 | 5.7% |
| E | 7593 | 4.6% |
| p | 7593 | 4.6% |
| Other values (10) | 35302 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 132586 | |
| Uppercase Letter | 28532 | 17.2% |
| Space Separator | 4611 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 20861 | |
| a | 16858 | |
| i | 16328 | |
| e | 12734 | |
| o | 12204 | |
| c | 11026 | |
| u | 9432 | |
| p | 7593 | 5.7% |
| f | 5885 | 4.4% |
| s | 5302 | 4.0% |
| Other values (4) | 14363 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15798 | |
| E | 7593 | |
| N | 2772 | 9.7% |
| S | 1839 | 6.4% |
| O | 530 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 4611 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 161118 | |
| Common | 4611 | 2.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 20861 | |
| a | 16858 | |
| i | 16328 | |
| A | 15798 | |
| e | 12734 | |
| o | 12204 | 7.6% |
| c | 11026 | 6.8% |
| u | 9432 | 5.9% |
| E | 7593 | 4.7% |
| p | 7593 | 4.7% |
| Other values (9) | 30691 |
Common
| Value | Count | Frequency (%) |
| 4611 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 165729 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 20861 | |
| a | 16858 | |
| i | 16328 | |
| A | 15798 | |
| e | 12734 | 7.7% |
| o | 12204 | 7.4% |
| c | 11026 | 6.7% |
| u | 9432 | 5.7% |
| E | 7593 | 4.6% |
| p | 7593 | 4.6% |
| Other values (10) | 35302 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| Europe | |
|---|---|
| Africa | |
| Asia | |
| North America | |
| South America |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 7.044646963 |
| Min length | 4 |
Characters and Unicode
| Total characters | 168515 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | South America |
|---|---|
| 2nd row | North America |
| 3rd row | South America |
| 4th row | Africa |
| 5th row | South America |
Common Values
| Value | Count | Frequency (%) |
| Europe | 7359 | |
| Africa | 6306 | |
| Asia | 4817 | |
| North America | 2703 | 11.3% |
| South America | 2161 | 9.0% |
| Oceania | 575 | 2.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| europe | 7359 | |
| africa | 6306 | |
| america | 4864 | |
| asia | 4817 | |
| north | 2703 | 9.4% |
| south | 2161 | 7.5% |
| oceania | 575 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 21232 | |
| a | 17137 | |
| i | 16562 | |
| A | 15987 | |
| e | 12798 | 7.6% |
| o | 12223 | 7.3% |
| c | 11745 | 7.0% |
| u | 9520 | 5.6% |
| E | 7359 | 4.4% |
| p | 7359 | 4.4% |
| Other values (10) | 36593 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 134866 | |
| Uppercase Letter | 28785 | 17.1% |
| Space Separator | 4864 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 21232 | |
| a | 17137 | |
| i | 16562 | |
| e | 12798 | |
| o | 12223 | |
| c | 11745 | |
| u | 9520 | |
| p | 7359 | 5.5% |
| f | 6306 | 4.7% |
| t | 4864 | 3.6% |
| Other values (4) | 15120 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15987 | |
| E | 7359 | |
| N | 2703 | 9.4% |
| S | 2161 | 7.5% |
| O | 575 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 4864 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 163651 | |
| Common | 4864 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 21232 | |
| a | 17137 | |
| i | 16562 | |
| A | 15987 | |
| e | 12798 | |
| o | 12223 | 7.5% |
| c | 11745 | 7.2% |
| u | 9520 | 5.8% |
| E | 7359 | 4.5% |
| p | 7359 | 4.5% |
| Other values (9) | 31729 |
Common
| Value | Count | Frequency (%) |
| 4864 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 168515 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 21232 | |
| a | 17137 | |
| i | 16562 | |
| A | 15987 | |
| e | 12798 | 7.6% |
| o | 12223 | 7.3% |
| c | 11745 | 7.0% |
| u | 9520 | 5.6% |
| E | 7359 | 4.4% |
| p | 7359 | 4.4% |
| Other values (10) | 36593 |
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.85468835 |
| Minimum | 1 |
|---|---|
| Maximum | 211 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 33 |
| median | 71 |
| Q3 | 115 |
| 95-th percentile | 174 |
| Maximum | 211 |
| Range | 210 |
| Interquartile range (IQR) | 82 |
Descriptive statistics
| Standard deviation | 52.35522517 |
|---|---|
| Coefficient of variation (CV) | 0.6724736337 |
| Kurtosis | -0.7546146622 |
| Mean | 77.85468835 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 0.4514227146 |
| Sum | 1862362 |
| Variance | 2741.069603 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22 | 214 | 0.9% |
| 3 | 209 | 0.9% |
| 29 | 203 | 0.8% |
| 5 | 198 | 0.8% |
| 11 | 198 | 0.8% |
| 10 | 198 | 0.8% |
| 1 | 198 | 0.8% |
| 4 | 197 | 0.8% |
| 34 | 197 | 0.8% |
| 12 | 197 | 0.8% |
| Other values (201) | 21912 |
| Value | Count | Frequency (%) |
| 1 | 198 | |
| 2 | 187 | |
| 3 | 209 | |
| 4 | 197 | |
| 5 | 198 | |
| 6 | 183 | |
| 7 | 178 | |
| 8 | 196 | |
| 9 | 177 | |
| 10 | 198 |
| Value | Count | Frequency (%) |
| 211 | 6 | < 0.1% |
| 210 | 12 | 0.1% |
| 209 | 11 | < 0.1% |
| 208 | 8 | < 0.1% |
| 207 | 11 | < 0.1% |
| 206 | 15 | 0.1% |
| 205 | 14 | 0.1% |
| 204 | 21 | |
| 203 | 43 | |
| 202 | 23 |
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 80.79737469 |
| Minimum | 1 |
|---|---|
| Maximum | 211 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 36 |
| median | 73 |
| Q3 | 119 |
| 95-th percentile | 179 |
| Maximum | 211 |
| Range | 210 |
| Interquartile range (IQR) | 83 |
Descriptive statistics
| Standard deviation | 53.23290188 |
|---|---|
| Coefficient of variation (CV) | 0.6588444499 |
| Kurtosis | -0.7663150753 |
| Mean | 80.79737469 |
| Median Absolute Deviation (MAD) | 41 |
| Skewness | 0.4438615419 |
| Sum | 1932754 |
| Variance | 2833.741843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 195 | 0.8% |
| 29 | 189 | 0.8% |
| 14 | 188 | 0.8% |
| 38 | 184 | 0.8% |
| 18 | 183 | 0.8% |
| 4 | 182 | 0.8% |
| 55 | 182 | 0.8% |
| 36 | 182 | 0.8% |
| 37 | 182 | 0.8% |
| 27 | 180 | 0.8% |
| Other values (201) | 22074 |
| Value | Count | Frequency (%) |
| 1 | 195 | |
| 2 | 180 | |
| 3 | 151 | |
| 4 | 182 | |
| 5 | 174 | |
| 6 | 168 | |
| 7 | 175 | |
| 8 | 157 | |
| 9 | 136 | |
| 10 | 157 |
| Value | Count | Frequency (%) |
| 211 | 5 | < 0.1% |
| 210 | 13 | 0.1% |
| 209 | 12 | 0.1% |
| 208 | 8 | < 0.1% |
| 207 | 17 | 0.1% |
| 206 | 19 | 0.1% |
| 205 | 19 | 0.1% |
| 204 | 19 | 0.1% |
| 203 | 39 | |
| 202 | 48 |
| Distinct | 1686 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 323.4014882 |
| Minimum | 0 |
|---|---|
| Maximum | 2164 |
| Zeros | 14290 |
| Zeros (%) | 59.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 547 |
| 95-th percentile | 1439 |
| Maximum | 2164 |
| Range | 2164 |
| Interquartile range (IQR) | 547 |
Descriptive statistics
| Standard deviation | 500.8257245 |
|---|---|
| Coefficient of variation (CV) | 1.548619109 |
| Kurtosis | 0.4078698977 |
| Mean | 323.4014882 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.348462631 |
| Sum | 7736087 |
| Variance | 250826.4064 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14290 | |
| 260 | 27 | 0.1% |
| 1174 | 22 | 0.1% |
| 369 | 20 | 0.1% |
| 340 | 19 | 0.1% |
| 924 | 19 | 0.1% |
| 228 | 19 | 0.1% |
| 323 | 19 | 0.1% |
| 389 | 18 | 0.1% |
| 374 | 18 | 0.1% |
| Other values (1676) | 9450 |
| Value | Count | Frequency (%) |
| 0 | 14290 | |
| 1 | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 6 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 2 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2164 | 2 | |
| 2160 | 2 | |
| 2124 | 2 | |
| 2036 | 2 | |
| 2017 | 1 | |
| 1998 | 1 | |
| 1955 | 2 | |
| 1832 | 2 | |
| 1828 | 1 | |
| 1827 | 2 |
| Distinct | 1679 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 315.4535764 |
| Minimum | 0 |
|---|---|
| Maximum | 2164 |
| Zeros | 14288 |
| Zeros (%) | 59.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 523 |
| 95-th percentile | 1416 |
| Maximum | 2164 |
| Range | 2164 |
| Interquartile range (IQR) | 523 |
Descriptive statistics
| Standard deviation | 490.9442731 |
|---|---|
| Coefficient of variation (CV) | 1.556312275 |
| Kurtosis | 0.4746270007 |
| Mean | 315.4535764 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.366652059 |
| Sum | 7545965 |
| Variance | 241026.2793 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 14288 | |
| 551 | 18 | 0.1% |
| 374 | 18 | 0.1% |
| 298 | 18 | 0.1% |
| 316 | 17 | 0.1% |
| 255 | 17 | 0.1% |
| 332 | 17 | 0.1% |
| 329 | 17 | 0.1% |
| 1378 | 17 | 0.1% |
| 338 | 17 | 0.1% |
| Other values (1669) | 9477 |
| Value | Count | Frequency (%) |
| 0 | 14288 | |
| 1 | 4 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 11 | < 0.1% |
| 5 | 12 | 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2164 | 1 | < 0.1% |
| 2124 | 2 | |
| 2104 | 1 | < 0.1% |
| 2099 | 4 | |
| 2087 | 1 | < 0.1% |
| 2041 | 1 | < 0.1% |
| 2036 | 2 | |
| 2014 | 1 | < 0.1% |
| 1832 | 4 | |
| 1828 | 1 | < 0.1% |
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.609213662 |
| Minimum | 0 |
|---|---|
| Maximum | 31 |
| Zeros | 6273 |
| Zeros (%) | 26.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 31 |
| Range | 31 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.630126712 |
|---|---|
| Coefficient of variation (CV) | 1.01299582 |
| Kurtosis | 12.8072405 |
| Mean | 1.609213662 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.210624947 |
| Sum | 38494 |
| Variance | 2.657313098 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 7229 | |
| 0 | 6273 | |
| 2 | 5263 | |
| 3 | 2613 | 10.9% |
| 4 | 1330 | 5.6% |
| 5 | 583 | 2.4% |
| 6 | 292 | 1.2% |
| 7 | 142 | 0.6% |
| 8 | 84 | 0.4% |
| 9 | 43 | 0.2% |
| Other values (11) | 69 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 6273 | |
| 1 | 7229 | |
| 2 | 5263 | |
| 3 | 2613 | 10.9% |
| 4 | 1330 | 5.6% |
| 5 | 583 | 2.4% |
| 6 | 292 | 1.2% |
| 7 | 142 | 0.6% |
| 8 | 84 | 0.4% |
| 9 | 43 | 0.2% |
| Value | Count | Frequency (%) |
| 31 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 3 | < 0.1% |
| 15 | 3 | < 0.1% |
| 14 | 6 | |
| 13 | 6 | |
| 12 | 9 | |
| 11 | 12 |
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.068266377 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 9558 |
| Zeros (%) | 40.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.263944313 |
|---|---|
| Coefficient of variation (CV) | 1.183173355 |
| Kurtosis | 10.92452874 |
| Mean | 1.068266377 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.187963747 |
| Sum | 25554 |
| Variance | 1.597555226 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9558 | |
| 1 | 7759 | |
| 2 | 4013 | |
| 3 | 1551 | 6.5% |
| 4 | 594 | 2.5% |
| 5 | 214 | 0.9% |
| 6 | 107 | 0.4% |
| 7 | 67 | 0.3% |
| 8 | 27 | 0.1% |
| 10 | 11 | < 0.1% |
| Other values (8) | 20 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 9558 | |
| 1 | 7759 | |
| 2 | 4013 | |
| 3 | 1551 | 6.5% |
| 4 | 594 | 2.5% |
| 5 | 214 | 0.9% |
| 6 | 107 | 0.4% |
| 7 | 67 | 0.3% |
| 8 | 27 | 0.1% |
| 9 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 11 | |
| 9 | 9 | < 0.1% |
| 8 | 27 |
| Distinct | 82 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| Friendly | |
|---|---|
| FIFA World Cup qualification | |
| UEFA Euro qualification | |
| African Cup of Nations qualification | |
| AFC Asian Cup qualification | 541 |
| Other values (77) |
Length
| Max length | 42 |
|---|---|
| Median length | 37 |
| Mean length | 17.90719452 |
| Min length | 7 |
Characters and Unicode
| Total characters | 428358 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | FIFA World Cup qualification |
|---|---|
| 2nd row | Friendly |
| 3rd row | FIFA World Cup qualification |
| 4th row | Friendly |
| 5th row | FIFA World Cup qualification |
Common Values
| Value | Count | Frequency (%) |
| Friendly | 8558 | |
| FIFA World Cup qualification | 5528 | |
| UEFA Euro qualification | 1723 | 7.2% |
| African Cup of Nations qualification | 1274 | 5.3% |
| AFC Asian Cup qualification | 541 | 2.3% |
| African Cup of Nations | 490 | 2.0% |
| FIFA World Cup | 432 | 1.8% |
| UEFA Nations League | 415 | 1.7% |
| COSAFA Cup | 309 | 1.3% |
| CECAFA Cup | 308 | 1.3% |
| Other values (72) | 4343 |
Length
| Value | Count | Frequency (%) |
| cup | 11350 | |
| qualification | 9647 | |
| friendly | 8558 | |
| world | 5960 | |
| fifa | 5960 | |
| nations | 2763 | 4.5% |
| uefa | 2391 | 3.9% |
| african | 2053 | 3.4% |
| euro | 1976 | 3.2% |
| of | 1767 | 2.9% |
| Other values (89) | 8833 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 46515 | 10.9% |
| 37337 | 8.7% | |
| a | 29604 | 6.9% |
| n | 26838 | 6.3% |
| F | 26533 | 6.2% |
| l | 25607 | 6.0% |
| o | 24292 | 5.7% |
| u | 24223 | 5.7% |
| r | 20428 | 4.8% |
| C | 16274 | 3.8% |
| Other values (44) | 150707 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 306531 | |
| Uppercase Letter | 84387 | 19.7% |
| Space Separator | 37337 | 8.7% |
| Other Punctuation | 98 | < 0.1% |
| Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 46515 | |
| a | 29604 | |
| n | 26838 | |
| l | 25607 | 8.4% |
| o | 24292 | 7.9% |
| u | 24223 | 7.9% |
| r | 20428 | 6.7% |
| d | 15188 | 5.0% |
| f | 13897 | 4.5% |
| p | 13355 | 4.4% |
| Other values (18) | 66584 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 26533 | |
| C | 16274 | |
| A | 15106 | |
| I | 6145 | 7.3% |
| W | 6081 | 7.2% |
| E | 4815 | 5.7% |
| N | 3205 | 3.8% |
| U | 2972 | 3.5% |
| O | 630 | 0.7% |
| G | 617 | 0.7% |
| Other values (12) | 2009 | 2.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 | |
| – | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 37337 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 98 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 390918 | |
| Common | 37440 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 46515 | 11.9% |
| a | 29604 | 7.6% |
| n | 26838 | 6.9% |
| F | 26533 | 6.8% |
| l | 25607 | 6.6% |
| o | 24292 | 6.2% |
| u | 24223 | 6.2% |
| r | 20428 | 5.2% |
| C | 16274 | 4.2% |
| d | 15188 | 3.9% |
| Other values (40) | 135416 |
Common
| Value | Count | Frequency (%) |
| 37337 | ||
| ' | 98 | 0.3% |
| - | 4 | < 0.1% |
| – | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 427968 | |
| None | 389 | 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 46515 | 10.9% |
| 37337 | 8.7% | |
| a | 29604 | 6.9% |
| n | 26838 | 6.3% |
| F | 26533 | 6.2% |
| l | 25607 | 6.0% |
| o | 24292 | 5.7% |
| u | 24223 | 5.7% |
| r | 20428 | 4.8% |
| C | 16274 | 3.8% |
| Other values (40) | 150317 |
None
| Value | Count | Frequency (%) |
| é | 304 | |
| í | 77 | 19.8% |
| á | 8 | 2.1% |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
| Distinct | 1576 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| Doha | 397 |
|---|---|
| Bangkok | 215 |
| Muscat | 212 |
| Kuwait City | 202 |
| Abu Dhabi | 191 |
| Other values (1571) |
Length
| Max length | 28 |
|---|---|
| Median length | 24 |
| Mean length | 7.723840977 |
| Min length | 2 |
Characters and Unicode
| Total characters | 184762 |
|---|---|
| Distinct characters | 122 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 474 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | La Paz |
|---|---|
| 2nd row | Maceió |
| 3rd row | Quito |
| 4th row | Conakry |
| 5th row | Asunción |
Common Values
| Value | Count | Frequency (%) |
| Doha | 397 | 1.7% |
| Bangkok | 215 | 0.9% |
| Muscat | 212 | 0.9% |
| Kuwait City | 202 | 0.8% |
| Abu Dhabi | 191 | 0.8% |
| London | 185 | 0.8% |
| Amman | 180 | 0.8% |
| Cairo | 164 | 0.7% |
| Dubai | 162 | 0.7% |
| Tehran | 161 | 0.7% |
| Other values (1566) | 21852 |
Length
| Value | Count | Frequency (%) |
| city | 527 | 1.9% |
| san | 435 | 1.5% |
| doha | 397 | 1.4% |
| port | 241 | 0.8% |
| bangkok | 215 | 0.8% |
| muscat | 212 | 0.7% |
| kuwait | 203 | 0.7% |
| abu | 191 | 0.7% |
| dhabi | 191 | 0.7% |
| london | 187 | 0.7% |
| Other values (1715) | 25660 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 25514 | 13.8% |
| n | 12597 | 6.8% |
| o | 12492 | 6.8% |
| i | 12241 | 6.6% |
| e | 11368 | 6.2% |
| r | 10010 | 5.4% |
| u | 7653 | 4.1% |
| l | 7455 | 4.0% |
| t | 7391 | 4.0% |
| s | 7128 | 3.9% |
| Other values (112) | 70913 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 150864 | |
| Uppercase Letter | 28402 | 15.4% |
| Space Separator | 4538 | 2.5% |
| Dash Punctuation | 530 | 0.3% |
| Other Punctuation | 417 | 0.2% |
| Initial Punctuation | 6 | < 0.1% |
| Decimal Number | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 25514 | |
| n | 12597 | 8.3% |
| o | 12492 | 8.3% |
| i | 12241 | 8.1% |
| e | 11368 | 7.5% |
| r | 10010 | 6.6% |
| u | 7653 | 5.1% |
| l | 7455 | 4.9% |
| t | 7391 | 4.9% |
| s | 7128 | 4.7% |
| Other values (66) | 37015 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3091 | |
| B | 2573 | 9.1% |
| A | 2400 | 8.5% |
| C | 2114 | 7.4% |
| M | 2033 | 7.2% |
| D | 1937 | 6.8% |
| L | 1929 | 6.8% |
| P | 1894 | 6.7% |
| K | 1600 | 5.6% |
| T | 1470 | 5.2% |
| Other values (29) | 7361 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 251 | |
| . | 162 | |
| / | 4 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 4538 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 530 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 6 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 179266 | |
| Common | 5496 | 3.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 25514 | 14.2% |
| n | 12597 | 7.0% |
| o | 12492 | 7.0% |
| i | 12241 | 6.8% |
| e | 11368 | 6.3% |
| r | 10010 | 5.6% |
| u | 7653 | 4.3% |
| l | 7455 | 4.2% |
| t | 7391 | 4.1% |
| s | 7128 | 4.0% |
| Other values (105) | 65417 |
Common
| Value | Count | Frequency (%) |
| 4538 | ||
| - | 530 | 9.6% |
| ' | 251 | 4.6% |
| . | 162 | 2.9% |
| ‘ | 6 | 0.1% |
| 6 | 5 | 0.1% |
| / | 4 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 182789 | |
| None | 1954 | 1.1% |
| Latin Ext Additional | 7 | < 0.1% |
| IPA Ext | 6 | < 0.1% |
| Punctuation | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 25514 | 14.0% |
| n | 12597 | 6.9% |
| o | 12492 | 6.8% |
| i | 12241 | 6.7% |
| e | 11368 | 6.2% |
| r | 10010 | 5.5% |
| u | 7653 | 4.2% |
| l | 7455 | 4.1% |
| t | 7391 | 4.0% |
| s | 7128 | 3.9% |
| Other values (48) | 68940 |
None
| Value | Count | Frequency (%) |
| é | 522 | |
| ó | 297 | |
| í | 185 | 9.5% |
| è | 104 | 5.3% |
| ș | 97 | 5.0% |
| ă | 85 | 4.4% |
| á | 72 | 3.7% |
| ã | 63 | 3.2% |
| à | 59 | 3.0% |
| ò | 57 | 2.9% |
| Other values (47) | 413 |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 6 |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 6 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ị | 2 | |
| ả | 2 | |
| ộ | 1 | |
| ầ | 1 | |
| ủ | 1 |
| Distinct | 217 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| USA | 1003 |
|---|---|
| South Africa | 505 |
| United Arab Emirates | 462 |
| Qatar | 461 |
| France | 445 |
| Other values (212) |
Length
| Max length | 30 |
|---|---|
| Median length | 22 |
| Mean length | 8.085238911 |
| Min length | 3 |
Characters and Unicode
| Total characters | 193407 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Bolivia |
|---|---|
| 2nd row | Brazil |
| 3rd row | Ecuador |
| 4th row | Guinea |
| 5th row | Paraguay |
Common Values
| Value | Count | Frequency (%) |
| USA | 1003 | 4.2% |
| South Africa | 505 | 2.1% |
| United Arab Emirates | 462 | 1.9% |
| Qatar | 461 | 1.9% |
| France | 445 | 1.9% |
| Germany | 285 | 1.2% |
| Saudi Arabia | 280 | 1.2% |
| Thailand | 280 | 1.2% |
| Japan | 277 | 1.2% |
| England | 277 | 1.2% |
| Other values (207) | 19646 |
Length
| Value | Count | Frequency (%) |
| usa | 1003 | 3.4% |
| republic | 658 | 2.2% |
| south | 513 | 1.7% |
| and | 509 | 1.7% |
| africa | 505 | 1.7% |
| united | 462 | 1.5% |
| arab | 462 | 1.5% |
| emirates | 462 | 1.5% |
| qatar | 461 | 1.5% |
| france | 445 | 1.5% |
| Other values (244) | 24377 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 29504 | |
| i | 15481 | 8.0% |
| n | 14874 | 7.7% |
| e | 12133 | 6.3% |
| r | 11936 | 6.2% |
| o | 9474 | 4.9% |
| t | 7232 | 3.7% |
| l | 7079 | 3.7% |
| u | 6744 | 3.5% |
| d | 6303 | 3.3% |
| Other values (50) | 72647 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 155498 | |
| Uppercase Letter | 31703 | 16.4% |
| Space Separator | 5936 | 3.1% |
| Other Punctuation | 243 | 0.1% |
| Dash Punctuation | 27 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 29504 | |
| i | 15481 | |
| n | 14874 | |
| e | 12133 | 7.8% |
| r | 11936 | 7.7% |
| o | 9474 | 6.1% |
| t | 7232 | 4.7% |
| l | 7079 | 4.6% |
| u | 6744 | 4.3% |
| d | 6303 | 4.1% |
| Other values (21) | 34738 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3941 | |
| A | 3577 | 11.3% |
| U | 2006 | 6.3% |
| C | 1924 | 6.1% |
| M | 1879 | 5.9% |
| R | 1800 | 5.7% |
| B | 1795 | 5.7% |
| E | 1723 | 5.4% |
| I | 1701 | 5.4% |
| T | 1556 | 4.9% |
| Other values (15) | 9801 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 151 | |
| ' | 92 |
Space Separator
| Value | Count | Frequency (%) |
| 5936 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 27 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 187201 | |
| Common | 6206 | 3.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 29504 | |
| i | 15481 | 8.3% |
| n | 14874 | 7.9% |
| e | 12133 | 6.5% |
| r | 11936 | 6.4% |
| o | 9474 | 5.1% |
| t | 7232 | 3.9% |
| l | 7079 | 3.8% |
| u | 6744 | 3.6% |
| d | 6303 | 3.4% |
| Other values (46) | 66441 |
Common
| Value | Count | Frequency (%) |
| 5936 | ||
| . | 151 | 2.4% |
| ' | 92 | 1.5% |
| - | 27 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 193237 | |
| None | 170 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 29504 | |
| i | 15481 | 8.0% |
| n | 14874 | 7.7% |
| e | 12133 | 6.3% |
| r | 11936 | 6.2% |
| o | 9474 | 4.9% |
| t | 7232 | 3.7% |
| l | 7079 | 3.7% |
| u | 6744 | 3.5% |
| d | 6303 | 3.3% |
| Other values (45) | 72477 |
None
| Value | Count | Frequency (%) |
| ô | 92 | |
| ç | 31 | 18.2% |
| é | 17 | 10.0% |
| ã | 15 | 8.8% |
| í | 15 | 8.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.5 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 17947 | |
| True | 5974 | 25.0% |
shoot_out
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.5 KiB |
| False | |
|---|---|
| True | 332 |
| Value | Count | Frequency (%) |
| False | 23589 | |
| True | 332 | 1.4% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.0 KiB |
| Win | |
|---|---|
| Lose | |
| Draw |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.508339952 |
| Min length | 3 |
Characters and Unicode
| Total characters | 83923 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Win |
|---|---|
| 2nd row | Draw |
| 3rd row | Win |
| 4th row | Win |
| 5th row | Lose |
Common Values
| Value | Count | Frequency (%) |
| Win | 11761 | |
| Lose | 6771 | |
| Draw | 5389 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| win | 11761 | |
| lose | 6771 | |
| draw | 5389 |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 11761 | |
| i | 11761 | |
| n | 11761 | |
| L | 6771 | |
| o | 6771 | |
| s | 6771 | |
| e | 6771 | |
| D | 5389 | |
| r | 5389 | |
| a | 5389 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 60002 | |
| Uppercase Letter | 23921 | 28.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 11761 | |
| n | 11761 | |
| o | 6771 | |
| s | 6771 | |
| e | 6771 | |
| r | 5389 | |
| a | 5389 | |
| w | 5389 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 11761 | |
| L | 6771 | |
| D | 5389 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83923 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| W | 11761 | |
| i | 11761 | |
| n | 11761 | |
| L | 6771 | |
| o | 6771 | |
| s | 6771 | |
| e | 6771 | |
| D | 5389 | |
| r | 5389 | |
| a | 5389 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 83923 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| W | 11761 | |
| i | 11761 | |
| n | 11761 | |
| L | 6771 | |
| o | 6771 | |
| s | 6771 | |
| e | 6771 | |
| D | 5389 | |
| r | 5389 | |
| a | 5389 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 15542 |
| Missing (%) | 65.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.96383817 |
| Minimum | 47 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 47 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 70 |
| median | 75 |
| Q3 | 81 |
| 95-th percentile | 88 |
| Maximum | 97 |
| Range | 50 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.212242192 |
|---|---|
| Coefficient of variation (CV) | 0.1095493826 |
| Kurtosis | 0.1345376971 |
| Mean | 74.96383817 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.2847013488 |
| Sum | 628122 |
| Variance | 67.44092181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 474 | 2.0% |
| 74 | 441 | 1.8% |
| 75 | 437 | 1.8% |
| 76 | 420 | 1.8% |
| 72 | 403 | 1.7% |
| 77 | 374 | 1.6% |
| 79 | 347 | 1.5% |
| 81 | 344 | 1.4% |
| 80 | 325 | 1.4% |
| 82 | 324 | 1.4% |
| Other values (40) | 4490 | 18.8% |
| (Missing) | 15542 |
| Value | Count | Frequency (%) |
| 47 | 6 | < 0.1% |
| 48 | 7 | < 0.1% |
| 49 | 9 | < 0.1% |
| 50 | 12 | 0.1% |
| 51 | 17 | |
| 52 | 30 | |
| 53 | 24 | |
| 54 | 20 | |
| 55 | 42 | |
| 56 | 32 |
| Value | Count | Frequency (%) |
| 97 | 6 | < 0.1% |
| 95 | 19 | 0.1% |
| 94 | 26 | 0.1% |
| 93 | 31 | 0.1% |
| 92 | 22 | 0.1% |
| 91 | 51 | 0.2% |
| 90 | 113 | |
| 89 | 120 | |
| 88 | 111 | |
| 87 | 145 |
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 15826 |
| Missing (%) | 66.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.21247684 |
| Minimum | 47 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 47 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 69 |
| median | 74 |
| Q3 | 80 |
| 95-th percentile | 87 |
| Maximum | 97 |
| Range | 50 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.225919096 |
|---|---|
| Coefficient of variation (CV) | 0.110842805 |
| Kurtosis | 0.1394057478 |
| Mean | 74.21247684 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.2838054963 |
| Sum | 600750 |
| Variance | 67.66574498 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75 | 473 | 2.0% |
| 73 | 466 | 1.9% |
| 72 | 424 | 1.8% |
| 74 | 404 | 1.7% |
| 76 | 365 | 1.5% |
| 77 | 343 | 1.4% |
| 69 | 340 | 1.4% |
| 81 | 313 | 1.3% |
| 78 | 311 | 1.3% |
| 79 | 310 | 1.3% |
| Other values (40) | 4346 | 18.2% |
| (Missing) | 15826 |
| Value | Count | Frequency (%) |
| 47 | 6 | < 0.1% |
| 48 | 10 | < 0.1% |
| 49 | 10 | < 0.1% |
| 50 | 21 | |
| 51 | 21 | |
| 52 | 30 | |
| 53 | 23 | |
| 54 | 24 | |
| 55 | 47 | |
| 56 | 39 |
| Value | Count | Frequency (%) |
| 97 | 5 | < 0.1% |
| 95 | 12 | 0.1% |
| 94 | 14 | 0.1% |
| 93 | 24 | 0.1% |
| 92 | 16 | 0.1% |
| 91 | 43 | 0.2% |
| 90 | 87 | |
| 89 | 91 | |
| 88 | 97 | |
| 87 | 122 |
| Distinct | 127 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 16134 |
| Missing (%) | 67.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.903249 |
| Minimum | 52.8 |
|---|---|
| Maximum | 91.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 52.8 |
|---|---|
| 5-th percentile | 65 |
| Q1 | 71 |
| median | 75.2 |
| Q3 | 78.8 |
| 95-th percentile | 85 |
| Maximum | 91.8 |
| Range | 39 |
| Interquartile range (IQR) | 7.8 |
Descriptive statistics
| Standard deviation | 6.003114482 |
|---|---|
| Coefficient of variation (CV) | 0.08014491443 |
| Kurtosis | 0.02149570954 |
| Mean | 74.903249 |
| Median Absolute Deviation (MAD) | 3.8 |
| Skewness | -0.1065522626 |
| Sum | 583271.6 |
| Variance | 36.03738348 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75.5 | 194 | 0.8% |
| 77 | 182 | 0.8% |
| 76 | 178 | 0.7% |
| 76.5 | 177 | 0.7% |
| 75.2 | 159 | 0.7% |
| 78.2 | 150 | 0.6% |
| 71.5 | 143 | 0.6% |
| 77.8 | 142 | 0.6% |
| 70.8 | 141 | 0.6% |
| 74.2 | 140 | 0.6% |
| Other values (117) | 6181 | 25.8% |
| (Missing) | 16134 |
| Value | Count | Frequency (%) |
| 52.8 | 6 | < 0.1% |
| 56.5 | 11 | |
| 57.5 | 1 | < 0.1% |
| 57.8 | 8 | |
| 58.2 | 3 | < 0.1% |
| 58.5 | 16 | |
| 58.8 | 3 | < 0.1% |
| 59 | 8 | |
| 59.2 | 4 | < 0.1% |
| 59.5 | 10 |
| Value | Count | Frequency (%) |
| 91.8 | 6 | < 0.1% |
| 90.5 | 4 | < 0.1% |
| 90.2 | 10 | < 0.1% |
| 89.5 | 11 | < 0.1% |
| 89 | 7 | < 0.1% |
| 88 | 21 | |
| 87.8 | 15 | 0.1% |
| 87.5 | 43 | |
| 87.2 | 10 | < 0.1% |
| 87 | 22 |
| Distinct | 103 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 15411 |
| Missing (%) | 64.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.81874266 |
| Minimum | 53.3 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 53.3 |
|---|---|
| 5-th percentile | 66 |
| Q1 | 71.7 |
| median | 75.7 |
| Q3 | 80 |
| 95-th percentile | 86.7 |
| Maximum | 93 |
| Range | 39.7 |
| Interquartile range (IQR) | 8.3 |
Descriptive statistics
| Standard deviation | 6.26841591 |
|---|---|
| Coefficient of variation (CV) | 0.08267633689 |
| Kurtosis | -0.05510707391 |
| Mean | 75.81874266 |
| Median Absolute Deviation (MAD) | 4.3 |
| Skewness | 0.01423465344 |
| Sum | 645217.5 |
| Variance | 39.29303802 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 76.3 | 229 | 1.0% |
| 77.7 | 223 | 0.9% |
| 72.7 | 218 | 0.9% |
| 76.7 | 218 | 0.9% |
| 71.3 | 218 | 0.9% |
| 72.3 | 215 | 0.9% |
| 74.7 | 197 | 0.8% |
| 73.3 | 194 | 0.8% |
| 73.7 | 184 | 0.8% |
| 75.7 | 174 | 0.7% |
| Other values (93) | 6440 | |
| (Missing) | 15411 |
| Value | Count | Frequency (%) |
| 53.3 | 4 | < 0.1% |
| 55 | 3 | < 0.1% |
| 57.7 | 6 | < 0.1% |
| 58 | 7 | < 0.1% |
| 58.3 | 4 | < 0.1% |
| 59 | 12 | |
| 59.3 | 3 | < 0.1% |
| 59.7 | 23 | |
| 60 | 14 | |
| 60.3 | 18 |
| Value | Count | Frequency (%) |
| 93 | 13 | 0.1% |
| 92.7 | 7 | < 0.1% |
| 92.3 | 13 | 0.1% |
| 91 | 19 | |
| 90.7 | 6 | < 0.1% |
| 90.3 | 13 | 0.1% |
| 90 | 25 | |
| 89.3 | 34 | |
| 89 | 12 | 0.1% |
| 88.7 | 37 |
| Distinct | 134 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 15759 |
| Missing (%) | 65.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.88929184 |
| Minimum | 54.2 |
|---|---|
| Maximum | 93.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 54.2 |
|---|---|
| 5-th percentile | 65 |
| Q1 | 72.5 |
| median | 76.2 |
| Q3 | 79.5 |
| 95-th percentile | 86 |
| Maximum | 93.2 |
| Range | 39 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.053109555 |
|---|---|
| Coefficient of variation (CV) | 0.07976236711 |
| Kurtosis | 0.2554212993 |
| Mean | 75.88929184 |
| Median Absolute Deviation (MAD) | 3.4 |
| Skewness | -0.2912718461 |
| Sum | 619408.4 |
| Variance | 36.64013529 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 76.2 | 243 | 1.0% |
| 76.8 | 207 | 0.9% |
| 75 | 192 | 0.8% |
| 74.8 | 189 | 0.8% |
| 78.2 | 188 | 0.8% |
| 78.5 | 182 | 0.8% |
| 75.2 | 169 | 0.7% |
| 78 | 164 | 0.7% |
| 79.2 | 158 | 0.7% |
| 77.2 | 157 | 0.7% |
| Other values (124) | 6313 | |
| (Missing) | 15759 |
| Value | Count | Frequency (%) |
| 54.2 | 2 | < 0.1% |
| 55.5 | 7 | |
| 56 | 1 | < 0.1% |
| 56.8 | 2 | < 0.1% |
| 57.2 | 7 | |
| 57.5 | 12 | |
| 57.8 | 8 | |
| 58 | 8 | |
| 58.2 | 3 | < 0.1% |
| 58.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 93.2 | 7 | < 0.1% |
| 92 | 7 | < 0.1% |
| 91.2 | 4 | < 0.1% |
| 89.8 | 7 | < 0.1% |
| 89.5 | 14 | |
| 89.2 | 9 | < 0.1% |
| 89 | 19 | |
| 88.8 | 4 | < 0.1% |
| 88.5 | 24 | |
| 88.2 | 17 |
| Distinct | 127 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 16357 |
| Missing (%) | 68.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.42437864 |
| Minimum | 52.8 |
|---|---|
| Maximum | 91.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 52.8 |
|---|---|
| 5-th percentile | 64.8 |
| Q1 | 70.5 |
| median | 74.5 |
| Q3 | 78.2 |
| 95-th percentile | 84.8 |
| Maximum | 91.8 |
| Range | 39 |
| Interquartile range (IQR) | 7.7 |
Descriptive statistics
| Standard deviation | 5.937425305 |
|---|---|
| Coefficient of variation (CV) | 0.07977796273 |
| Kurtosis | 0.02034776311 |
| Mean | 74.42437864 |
| Median Absolute Deviation (MAD) | 3.7 |
| Skewness | -0.04461795013 |
| Sum | 562946 |
| Variance | 35.25301925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75.5 | 175 | 0.7% |
| 77 | 167 | 0.7% |
| 76 | 162 | 0.7% |
| 74.5 | 157 | 0.7% |
| 75.2 | 157 | 0.7% |
| 76.5 | 152 | 0.6% |
| 70.8 | 151 | 0.6% |
| 74.2 | 148 | 0.6% |
| 71.5 | 138 | 0.6% |
| 78.2 | 136 | 0.6% |
| Other values (117) | 6021 | 25.2% |
| (Missing) | 16357 |
| Value | Count | Frequency (%) |
| 52.8 | 7 | |
| 56.5 | 8 | |
| 57.5 | 4 | < 0.1% |
| 57.8 | 5 | < 0.1% |
| 58.2 | 4 | < 0.1% |
| 58.5 | 12 | |
| 58.8 | 6 | |
| 59 | 10 | |
| 59.2 | 1 | < 0.1% |
| 59.5 | 13 |
| Value | Count | Frequency (%) |
| 91.8 | 5 | < 0.1% |
| 90.5 | 7 | < 0.1% |
| 90.2 | 7 | < 0.1% |
| 89.5 | 3 | < 0.1% |
| 89 | 10 | < 0.1% |
| 88 | 17 | |
| 87.8 | 16 | |
| 87.5 | 30 | |
| 87.2 | 10 | < 0.1% |
| 87 | 21 |
| Distinct | 103 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 15609 |
| Missing (%) | 65.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.42001925 |
| Minimum | 53.3 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 53.3 |
|---|---|
| 5-th percentile | 65.7 |
| Q1 | 71.3 |
| median | 75.3 |
| Q3 | 79.7 |
| 95-th percentile | 86 |
| Maximum | 93 |
| Range | 39.7 |
| Interquartile range (IQR) | 8.4 |
Descriptive statistics
| Standard deviation | 6.201905739 |
|---|---|
| Coefficient of variation (CV) | 0.08223155869 |
| Kurtosis | -0.05994827244 |
| Mean | 75.42001925 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.006600849852 |
| Sum | 626891.2 |
| Variance | 38.4636348 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 72.3 | 238 | 1.0% |
| 71.3 | 234 | 1.0% |
| 72.7 | 213 | 0.9% |
| 76.7 | 206 | 0.9% |
| 76.3 | 203 | 0.8% |
| 77.7 | 200 | 0.8% |
| 73 | 192 | 0.8% |
| 70.7 | 184 | 0.8% |
| 75.3 | 180 | 0.8% |
| 74.7 | 178 | 0.7% |
| Other values (93) | 6284 | |
| (Missing) | 15609 |
| Value | Count | Frequency (%) |
| 53.3 | 4 | < 0.1% |
| 55 | 4 | < 0.1% |
| 57.7 | 7 | < 0.1% |
| 58 | 8 | < 0.1% |
| 58.3 | 3 | < 0.1% |
| 59 | 12 | 0.1% |
| 59.3 | 5 | < 0.1% |
| 59.7 | 30 | |
| 60 | 6 | < 0.1% |
| 60.3 | 20 |
| Value | Count | Frequency (%) |
| 93 | 8 | < 0.1% |
| 92.7 | 5 | < 0.1% |
| 92.3 | 15 | |
| 91 | 13 | |
| 90.7 | 6 | < 0.1% |
| 90.3 | 8 | < 0.1% |
| 90 | 13 | |
| 89.3 | 24 | |
| 89 | 15 | |
| 88.7 | 26 |
| Distinct | 134 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 15942 |
| Missing (%) | 66.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.25914275 |
| Minimum | 54.2 |
|---|---|
| Maximum | 93.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 187.0 KiB |
Quantile statistics
| Minimum | 54.2 |
|---|---|
| 5-th percentile | 63.8 |
| Q1 | 71.8 |
| median | 75.5 |
| Q3 | 79 |
| 95-th percentile | 85.5 |
| Maximum | 93.2 |
| Range | 39 |
| Interquartile range (IQR) | 7.2 |
Descriptive statistics
| Standard deviation | 6.124573345 |
|---|---|
| Coefficient of variation (CV) | 0.0813797915 |
| Kurtosis | 0.188041849 |
| Mean | 75.25914275 |
| Median Absolute Deviation (MAD) | 3.7 |
| Skewness | -0.2751545753 |
| Sum | 600492.7 |
| Variance | 37.51039866 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 76.2 | 208 | 0.9% |
| 75 | 196 | 0.8% |
| 74.8 | 185 | 0.8% |
| 78 | 169 | 0.7% |
| 76.8 | 169 | 0.7% |
| 78.5 | 168 | 0.7% |
| 77.2 | 166 | 0.7% |
| 78.2 | 165 | 0.7% |
| 74.5 | 156 | 0.7% |
| 75.2 | 155 | 0.6% |
| Other values (124) | 6242 | 26.1% |
| (Missing) | 15942 |
| Value | Count | Frequency (%) |
| 54.2 | 10 | |
| 55.5 | 6 | |
| 56 | 2 | < 0.1% |
| 56.8 | 3 | < 0.1% |
| 57.2 | 8 | |
| 57.5 | 7 | |
| 57.8 | 6 | |
| 58 | 6 | |
| 58.2 | 6 | |
| 58.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 93.2 | 3 | < 0.1% |
| 92 | 5 | < 0.1% |
| 91.2 | 7 | < 0.1% |
| 89.8 | 10 | < 0.1% |
| 89.5 | 8 | < 0.1% |
| 89.2 | 5 | < 0.1% |
| 89 | 15 | |
| 88.8 | 2 | < 0.1% |
| 88.5 | 25 | |
| 88.2 | 8 | < 0.1% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| date | home_team | away_team | home_team_continent | away_team_continent | home_team_fifa_rank | away_team_fifa_rank | home_team_total_fifa_points | away_team_total_fifa_points | home_team_score | away_team_score | tournament | city | country | neutral_location | shoot_out | home_team_result | home_team_goalkeeper_score | away_team_goalkeeper_score | home_team_mean_defense_score | home_team_mean_offense_score | home_team_mean_midfield_score | away_team_mean_defense_score | away_team_mean_offense_score | away_team_mean_midfield_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1993-08-08 | Bolivia | Uruguay | South America | South America | 59 | 22 | 0 | 0 | 3 | 1 | FIFA World Cup qualification | La Paz | Bolivia | False | No | Win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 1993-08-08 | Brazil | Mexico | South America | North America | 8 | 14 | 0 | 0 | 1 | 1 | Friendly | Maceió | Brazil | False | No | Draw | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | 1993-08-08 | Ecuador | Venezuela | South America | South America | 35 | 94 | 0 | 0 | 5 | 0 | FIFA World Cup qualification | Quito | Ecuador | False | No | Win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | 1993-08-08 | Guinea | Sierra Leone | Africa | Africa | 65 | 86 | 0 | 0 | 1 | 0 | Friendly | Conakry | Guinea | False | No | Win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 1993-08-08 | Paraguay | Argentina | South America | South America | 67 | 5 | 0 | 0 | 1 | 3 | FIFA World Cup qualification | Asunción | Paraguay | False | No | Lose | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | 1993-08-08 | Peru | Colombia | South America | South America | 70 | 19 | 0 | 0 | 0 | 1 | FIFA World Cup qualification | Lima | Peru | False | No | Lose | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | 1993-08-08 | Zimbabwe | Eswatini | Africa | Africa | 50 | 102 | 0 | 0 | 2 | 0 | Friendly | Harare | Zimbabwe | False | No | Win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 1993-08-09 | Guinea | Sierra Leone | Africa | Africa | 65 | 86 | 0 | 0 | 4 | 0 | Friendly | Conakry | Guinea | False | No | Win | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | 1993-08-11 | Faroe Islands | Norway | Europe | Europe | 111 | 9 | 0 | 0 | 0 | 7 | Friendly | Toftir | Faroe Islands | False | No | Lose | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | 1993-08-11 | Sweden | Switzerland | Europe | Europe | 4 | 3 | 0 | 0 | 1 | 2 | Friendly | Borås | Sweden | False | No | Lose | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
Last rows
| date | home_team | away_team | home_team_continent | away_team_continent | home_team_fifa_rank | away_team_fifa_rank | home_team_total_fifa_points | away_team_total_fifa_points | home_team_score | away_team_score | tournament | city | country | neutral_location | shoot_out | home_team_result | home_team_goalkeeper_score | away_team_goalkeeper_score | home_team_mean_defense_score | home_team_mean_offense_score | home_team_mean_midfield_score | away_team_mean_defense_score | away_team_mean_offense_score | away_team_mean_midfield_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 23911 | 2022-06-14 | Ukraine | Republic of Ireland | Europe | Europe | 27 | 47 | 1535 | 1449 | 1 | 1 | UEFA Nations League | Łódź | Poland | True | No | Draw | 75.0 | 75.0 | 74.8 | 78.7 | 80.0 | 76.5 | 72.7 | 73.8 |
| 23912 | 2022-06-14 | Bosnia and Herzegovina | Finland | Europe | Europe | 59 | 57 | 1388 | 1406 | 3 | 2 | UEFA Nations League | Zenica | Bosnia and Herzegovina | False | No | Win | 76.0 | 83.0 | 74.2 | 77.0 | 78.0 | 70.0 | 72.3 | 73.5 |
| 23913 | 2022-06-14 | Romania | Montenegro | Europe | Europe | 48 | 70 | 1446 | 1342 | 0 | 3 | UEFA Nations League | Bucharest | Romania | False | No | Lose | 77.0 | 65.0 | 73.5 | 73.7 | 75.0 | 76.2 | 74.7 | 68.2 |
| 23914 | 2022-06-14 | Luxembourg | Faroe Islands | Europe | Europe | 94 | 124 | 1229 | 1137 | 2 | 2 | UEFA Nations League | Luxembourg | Luxembourg | False | No | Draw | 69.0 | NaN | 68.5 | NaN | 69.8 | NaN | NaN | NaN |
| 23915 | 2022-06-14 | Turkey | Lithuania | Europe | Europe | 43 | 138 | 1461 | 1092 | 2 | 0 | UEFA Nations League | İzmir | Turkey | False | No | Win | 79.0 | 71.0 | 78.2 | 76.7 | 78.2 | NaN | NaN | NaN |
| 23916 | 2022-06-14 | Moldova | Andorra | Europe | Europe | 180 | 153 | 932 | 1040 | 2 | 1 | UEFA Nations League | Chișinău | Moldova | False | No | Win | 65.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 23917 | 2022-06-14 | Liechtenstein | Latvia | Europe | Europe | 192 | 135 | 895 | 1105 | 0 | 2 | UEFA Nations League | Vaduz | Liechtenstein | False | No | Lose | NaN | 65.0 | NaN | NaN | NaN | NaN | NaN | NaN |
| 23918 | 2022-06-14 | Chile | Ghana | South America | Africa | 28 | 60 | 1526 | 1387 | 0 | 0 | Kirin Cup | Suita | Japan | True | Yes | Lose | 79.0 | 74.0 | 75.5 | 76.7 | 78.2 | 75.5 | 76.0 | 78.2 |
| 23919 | 2022-06-14 | Japan | Tunisia | Asia | Africa | 23 | 35 | 1553 | 1499 | 0 | 3 | Kirin Cup | Suita | Japan | False | No | Lose | 73.0 | NaN | 75.2 | 75.0 | 77.5 | 70.8 | 72.3 | 74.0 |
| 23920 | 2022-06-14 | Korea Republic | Egypt | Asia | Africa | 29 | 32 | 1519 | 1500 | 4 | 1 | Friendly | Seoul | Korea Republic | False | No | Win | 75.0 | NaN | 73.0 | 80.0 | 73.8 | NaN | 79.3 | 70.8 |